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Persistence.  Runs,  and  Recurrence  of  Sky  Cover 


1.  INTRODUCTION 

Duration,  persistence,  runs,  and  recurrence  are  all  interrelated.  For  this 
study,  they  have  been  defined  as  follows:  duration-continuous  successes; 
persistence-consecutive  successes  separated  by  one  hour;  runs-consecutive 
successes  separated  by  intervals  of  one  hour  beginning  and  ending  with  failure; 
and  recurrence-successes  occurring  at  time  t and  also  at  time  t+x  hours. 

This  study  is  part  of  a more  comprehensive  investigation  conducted  to  obtain 
a better  understanding  of  persistence,  runs,  and  recurrence  of  weather  events. 
Duration  could  not  be  studied  because  the  data  were  observed  at  hourly  intervals. 

Of  major  interest  are  those  weather  events  which  are  usually  recorded  in  categories, 
for  example;  precipitation  recorded  as  none,  light,  moderate,  or  heavy;  or  sky 
cover  recorded  as  clear,  scattered,  broken,  or  overcast.  Persistence,  runs,  and 
recurrence  of  precipitation  are  described  in  a paper  by  Lund  and  Grantham.  ^ 

This  report  includes  tables  of  observed  relative  frequencies  of  four  sky-cover 
categories  and  models  for  estimating  probabilities  of  each  category.  The  models 
provide  answers  to  such  questions  as;  What  is  the  probability  of  observing  a sequence 
of  more  than  five  hours  of  overcast  sky;  of  observing  a run  of  exactly  five  hours  of 

(Received  for  publication  20  December  197Y) 

1.  Lund,  LA.,  and  Grantham,  D.  D.  ( 19771  Persistence,  runs,  and  recurrence 
of  precipitation,  J.  Appl.  Meteor.  16:346-358. 
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overcast;  and,  of  observing  overcast  at  time  t and  also  at  time  t+5  hours?  The 
models  require  a knowledge  of  the  unconditional  probability  of  the  event,  in  this 
case  a sky-cover  category,  and  a measure  of  the  temporal  correlation  between 
occurrences  of  sky -cover  categories. 


2.  DATA 


Records  of  hourly  total  sky  cover  observations  taken  in  winter  (December, 
January,  February)  and  summer  (June.  July.  August)  during  the  13 -year  period 
1951  through  1963,  at  the  following  nine  stations,  shown  in  Figure  1,  were  studied; 
LGA  LaGuardia  Airport,  New  York,  NY, 

JFK  Kennedy  International  Airport,  New  York,  NY, 

EWR  Newark  Airport,  NJ, 

PHL  Philadelphia  International  Airport,  PA, 

BAL  Baltimore -Washington  International  Airport,  MD, 

DC  A National  Airport,  Washington,  DC, 

ADW  Andrews  AFB.  MD. 

RIC  Byrd  Field,  Richmond,  VA, 

RDU  Raleigh -Durham  Airport,  NC. 


Figure  1.  Location  of  the 
Nine  Stations  Whose  Winter 
and  Summer  Hourly  Observations 
of  Total  Sky  Cover  Were 
Studied 


Each  hour,  approximately  on  the  hour,  a weather  observer  at  each  of  the  above 

stations  went  outdoors  to  make  a regular  hourly  observation.  One  of  the  weather 

2 

elements  recorded  is  total  sky  cover.  The  Federal  Meteorological  Handbook 
describes  how  the  observations  are  taken.  The  four  sky -cover  categories  studied 
are  shown  in  Table  1.  The  abbreviations  LE  and  GE  stand  for  "less  than  or  equal  to" 
and  "greater  than  or  equal  to",  respectively.  The  two  ends  of  the  frequency  distri- 
bxition  were  studied  separately  to  determine  whether  temporal  correlation  is  a 
function  of  cloud  amount. 


Table  1.  Sky  Cover  Categories 


Category 

Total  Sky  Cover 

1 

0.  0 (clear) 

2 

LE  0.  3 (0.  0,  0.  1,  0.  2,  and  0.  3) 

3 

GE  0.  8 (0.  8,  0.  9,  and  1.  0) 

4 

1.  0 (overcast) 

3.  DATA  PROCESSING 

Each  hourly  sky  cover  observation  was  categorized  as  follows:  Zero-tenths 
(clear),  less  than  or  equal  to  (LE)  three-tenths,  greater  than  or  equal  to  (GE)  eight- 
tenths  and  ten-tenths  (overcast).  Some  of  the  stations  had  no  missing  observations, 
others  only  a very  few.  These  few  observations  were  filled  in  by  estimating  the  sky 
cover  from  observations  taken  at  nearby  stations  and  observations  taken  before  and 
after  the  missing  observations.  There  were  28,  080  [(24  observations /day)  X 
(90  days/season)  X (13  seasons)]  observations,  in  winter  and  28,  704  [(24  observa- 
tions/day)  X (92  days/season)  X (13  seasons)]  observations,  in  summer,  processed 
for  each  station. 

4.  PERSISTENCE 

4.1  Observed 

The  occurrence  of  a given  sky  cover  category  was  denoted  as  a success,  S,  and 

non-occurrence  as  a failure,  F.  The  relative  frequency  of  one  success,  RF(Sj),  is 

found  from  the  data  by  dividing  the  number  of  times  the  sky  cover  category  occurred, 

2.  U.S.  Department  of  Commerce  (1975)  Federal  Meteorological  Handbook  No.  1, 
Surface  Observations,  U.S.  Government  Printing  Orrice,  Washington,  D.  C. 

309  pp. 
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n(Sj)  by  the  sample  size  N.  The  relative  frequency  of  two  successes  in  a row 
RPiSg).  IS  found  from  the  data  by  dividing  the  number  of  times  a success  was 
followed  by  a success.  n(S^).  by  the  sample  size  N minus  the  end  effect,  in  this 
case  13.  because  there  were  13  years  when  the  next  season's  data  were  not  used 
to  determine  the  sky  cover  on  the  first  hour  of  the  next  season.  The  relative  fre- 
quency of  X successes  in  a row.  RF(S^).  is  found  by  dividing  the  number  of  times 
X consecutive  successes  was  observed.  n(S^).  by  the  sample  size.  N.  minus  the 
end  effects,  in  this  case  13(x-l), 

RF(S  ) 


n(S  ) 

X 


x'  N-15{x-1T  ' 


n(S  ) 

X 

N 


(1) 


This  processing  of  the  data  was  done  for  all  categories  for  all  nine  stations  in  both 
winter  and  summer. 

The  relative  frequency  of  a success  given  that  x consecutive  successes  have 

occurred.  RF(S  S^).  is  equal  to  the  relative  frequency  of  x+1  consecutive  successes. 

^^^^x+1^  divided  by  the  relative  frequency  of  x consecutive  successes.  RF(S  ) that 
is.  X * 


, RF(S 


(2) 


The  conditional  relative  frequencies  RF(s|s^)  were  computed  for  periods  up  to 
72  hours.  Selected  values  for  the  first  15  hours  are  shown  for  all  nine  stations  and 
all  four  sky  cover  categories  in  Tables  2.  3.  4.  and  5.  The  median  relative  fre- 
quencies are  indicated  with  asterisks. 

The  first  column  in  each  of  the  tables  gives  RF(s|s^)  which  is  defined  as  RFfS) 
the  unconditional  relative  frequency  of  the  given  sky  cover  category.  Although  both” 
the  unconditional  and  conditional  relative  frequencies  vary  from  station  to  station, 
there  is  often  no  consistent  pattern  to  the  variations.  It  was  subjectively  decided’to 
assume  that  the  data  from  all  stations  were  drawn  from  the  same  sample  and  to  use 
the  median  values  to  obtain  estimates  of  the  conditional  probabilities  P(s|s  ). 
required  for  obtaining  estimates  of  joint  probabilities.  P(S^^j).  that  is.  P(S  Sh 
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The  median  values  of  RF(sls^)  for  winter  and  summer  are  shown  in  Figure  2 
for  periods  up  to  15  hours.  The  median  relative  frequency,  RF(S),  of  the  most 
frequently  occurring  category,  GE  0.8,  was  0.5022  in  winter  and  0.4160  in  summer. 
Because  this  is  a frequently  occurring  category  there  were  many  long  sequences  of 
successes.  The  median  conditional  relative  frequencies  for  GE  0.8,  given  in  Table 
6 and  shown  as  x's  in  Figure  2,  increase  in  magnitude  for  13  hours  in  winter  and 
12  hours  in  summer.  They  never  vary  significantly,  that  is,  by  more  than  0.  015, 
for  the  next  few  hours  after  hour  15,  therefore  the  estimated  conditional  probabilities 
are  regarded  as  constant  after  hour  15.  The  conditional  relative  frequencies  of  the 
less  frequently  occurring  categories  are  more  variable  as  expected  but  they  never 
depart  from  the  hour  15  conditional  relative  frequencies  by  more  than  0.  069. 

Sample  relative  frequencies  of  x consecutive  successes,  RF(S^),  obtained  from 
the  data  sample,  are  given  in  Tables  7,  8,  9,  and  10.  All  of  the  relative  frequencies 
for  hours  1 through  70  are  shown  in  Figures  3,  4,  5,  and  6. 


Figure  2.  Relative  Frequencies 
of  Success,  Given  x Hours  of 
Consecutive  Successes  Have 
Occurred,  in  Winter  (a)  and 
Summer  (b).  The  curves  were 
subjectively  drawn 
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Figure  3.  Relative  Frequencies 
of  X Hours  of  Consecutive 
Successes,  in  Winter  (dots) 
and  in  Summer  (X's),  Where 
Zero-tenths  Sky  Cover  (clear) 
is  Regarded  as  a Success. 

The  solid  !ines  are  solutions 
to  Eqs.  (3)  and  (6)  for 
Winter  and  the  dashed  tines 
are  for  Summer  (see  text) 


Figure  4,  Relative  Frequencies 
of  X Hours  of  Consecutive 
Successes,  in  Winter  (dots) 
and  in  Summer  (X's),  When 
LE  Three -tenths  Sky  Cover 
is  Regarded  as  a Success. 

The  solid  lines  are  solutions 
to  Eqs.  (3)  and  (6)  for  Winter 
and  the  dashed  lines  are  for 
Summer  (see  text) 
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Figure  5,  Relative  Frequencies 
of  X Hours  of  Consecutive 
Successes,  in  Winter  (dots) 
and  in  Summer  (X's),  When 
GE  Eight -tenths  Sky  Cover 
is  Regarded  as  a Success. 

The  solid  lines  are  solutions 
to  Eqs.  (3)  and  (6)  for  Winter 
and  the  dashed  lines  are  for 
Summer  (see  text) 
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Figure  6.  Relative  Frequencies 
of  X Hours  of  Consecutive 
Successes,  in  Winter  (dots) 
and  in  Summer  (X's),  When 
Ten -tenths  Sky  Covt  r (overcast) 
is  Regarded  as  a Si>f:cess. 

The  solid  lines  are  b..!utions 
to  Eqs.  (3)  and  (6)  for  Winter 
and  the  dashed  lines  are  for 
Summer  (see  text) 
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4.2  Modeled 

The  probability  of  a sequence  of  x hours  of  successes  is  sometimes  estimated 
with  the  following  first  order  Markov  chain 

P(S^)  = P(S)  [ P(s|Sj)]*”^  (3) 

where  PiS^Sj^)  is  the  probability  of  a success  given  a success  has  occurred  and  x 
equals  the  number  of  hours. 

The  relative  frequencies  RF(S)  and  RF(s|Sj),  obtained  from  the  data,  are  given 
in  the  first  two  columns  in  Tables  2 through  5.  The  estimated  conditional  proba- 
bilities, P(s|Sj),  shown  in  Table  6,  were  used  to  test  Eq.  (3).  The  model  fit  the 
observed  values,  within  a few  percent,  for  the  first  few  hours  but  there  were  large 
differences  between  the  model  estimates  and  corresponding  sample  relative  fre- 
quencies when  probabilities  of  sequences  of  successes  longer  than  a few  hours  were 
estimated.  Figures  3 to  6 illustrate  differences  between  model  estimates  and  ob- 
served relative  frequencies  when  the  nine  station  median  relative  frequencies  are 
used  to  represent  the  sample  values.  These  figures  illustrate  the  failure  of  Eq,  (3) 
to  adequately  estimate  long  sequences  of  successes.  For  example,  note  the  large  i 

departures  of  the  first  order  Markov  model  estimates,  labelled  (3),  from  the  sample 
relative  frequencies  of  overcast  sky  cover  shown  in  Figure  6. 

To  improve  the  model  given  in  Eq.  (3)  the  following  axiomatic  expressions  can 
be  used  to  estimate  two  and  x hours  of  consecutive  successes,  respectively 

P'(S2)  = P(S)  P(s|Sj)  (4) 

P'(S^)  = P(S)  P(s|Sj).  . . P(sls^_j)  x>3  (5) 

where  P(S^Sj)  is  the  probability  of  a success  given  that  a success  occurred  the 
previous  hour  and  P(s|s^_j)  is  the  probability  of  a success  following  (x-1)  hours 
of  unbroken  successes. 

The  probabilities  required  for  the  solution  of  Eq.  (5)  were  estimated  from  the 
relative  frequencies  and  it  was  assumed  that  the  conditional  probabilities  were 
always  constant  beyond  15  hours.  To  estimate  joint  probabilities  Eq.  (5)  was  ex- 
pressed as  follows 

P'(S^)  = P(S)  P(s|Sj^_j)  X = 2 

* r(S)  P(s|Sj) p‘(s|Sj^_j)  3Sx«15 

P'(S^)  * P(S)  P(s|Sj) PisISj^)  [ X 2 16 
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Curves  were  drawn  for  the  points  in  Figure  2 and  estimates  of  P(sls  ) for  use 
in  Eq.  (6)  were  obtained  from  the  curves.  These  values  are  given  in  Table  6 in  the 
rows  labelled  P(s|s^).  The  winter  curves  in  Figure  2 follow  the  median  values  very 
closely  but  the  summer  curves  were  drawn  smoothly  without  following  as  near  to 
the  data  points.  The  departures  of  the  points  representing  the  clear  and  LE  three - 
tenths  categories  from  the  subjectively  drawn  curves  are  not  understood,  but  they 
may  be  related  to  the  diurnal  period  in  these  two  sky  cover  categories. 

Solutions  to  Eq.  (6)  are  shown  by  the  curves  in  Figures  3 to  6.  They  are  in 
better  agreement  with  the  relative  frequencies  than  curves  based  on  Eq.  (3).  This 
must  be  the  case,  because  the  probability  estimates  for  Eq.  (6)  are  based  more 
closely  on  the  relative  frequencies.  Modeling  is  involved  in  the  smoothing  of  the 
relative  frequencies  and  in  the  assumption  that  P(s|s^)is  constant  beyond  15  hours. 

Gringorten  simulated  probability  distributions  by  a Monte  Carlo  exercise  and 
prepared  charts  for  use  in  estimating  the  duration  of  weather  events.  These  charts 
were  used  to  estimate  conditional  probabilities  of  the  four  categories  of  total  sky 
cover.  Hour -to -hour  correlation  for  this  application  of  Gringorten 's  method  was 
assumed  to  be  0.  950  in  winter  and  0.  932  in  summer.  Table  6 shows  that  the  condi- 
tional probability  estimates  obtained  by  Gringorten' s method  are  in  generally  good 
agreement  with  the  observed  values.  One  shortcoming  of  the  method  is  that  it  is 
graphical.  It  is  difficult  to  estimate  the  probabilities  from  the  charts.  A promising 
analytical  method  described  in  the  treatise  by  Keilson  and  Ross'*  needs  further 
development  before  it  can  be  applied  to  this  problem.  In  the  absence  of  conditional 
relative  frequencies  obtained  from  large  data  samples,  Gringorten's  method  can 
provide  suitable  estimates  of  the  probabilities  required  for  the  solution  of  Eq.  (6). 
Because  a large  sample  of  data  was  available  for  this  study,  a smooth  subjective 
fit  to  the  relative  frequencies  was  used  to  estimate  the  required  probabilities. 

Table  11  summarizes  some  of  the  information  obtainable  from  Eq.  (6)  and  the  data 
points  shown  in  Figures  3 to  6.  It  shows  the  number  of  hours,  x,  that  each  of  the  four  sky 
cover  categories  was  estimated,  and  observed,  to  persist,  at  six  probability  levels  based 
on  P(Sj^)andRF(S^),  respectively.  For  example.  Table  11  shows  that  clear  (0.  0 sky  cover) 
has  a climatic  occurrence  probability  of  0.2634,  inwinter.  Therefore,  morethan50% 
of  the  time  clear  would  not  be  expected  and  a zero  is  entered  in  the  table.  However,  more 
than  25%  of  the  time  clear  is  expected  to  be  observed  for  at  least  one  hour.  Clear  is 
expected  to  persist  for  8,  14,  27,  and  46  hours;  10%,  5%,  1%,  andO.  1%  of  the  time  re- 
spectively, as  can  be  seen  from  the  solid  curve  based  on  Eq.  (6)  shown  in  Figure  3.  The 
corresponding  observed  values  are  shown  in  parentheses  in  the  table. 

3.  Gringorten,  1.  1.  (1966)  A stochastic  model  of  the  frequency  and  duration  of 

weather  events,  J.  Appl.  Meteor.  ^606-624. 

4.  Keilson,  J, , and  Ross,  H.  F,  (1975)  Passage  time  distributions  for  C^ussian 

Markov  (Ornstein-Uhlenbeck)  statistical  processes.  Selected  Tables  in 

Mathematical  Statistics  Vol.  Ill,  American  Mathemal'ical  bioclely, 

providence,  Rhode  Island,  pp  233-327. 
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The  values  given  in  Tables  11  and  12  are  for  east -coast  stations.  They  apply 
elsewhere  only  to  the  extent  that  the  probability  of  the  event  and  the  hour-to-hour 
correlation  is  the  same. 

Table  12  summarizes  some  of  the  information  obtainable  from  the  following 
equation. 


P(S^|S) 


p(sr 


(7) 


This  equation  is  used  to  estimate  the  conditional  probability  of  observing  a sequence 
of  X hours  of  a weather  category.  It  can  be  used  to  answer  questions  such  as; 
given  that  it  is  clear,  how  many  hours  will  it  be  before  there  is  a 50%  probability 
that  the  sequence  of  clear  skies  will  be  broken.  The  unconditional  probability,  P(S), 
is  assumed  to  be  known  and  P(S  Is)  must  equal  0.50.  Substituting,  for  example, 
the  winter  unconditional  probability  of  observing  clear,  0.  2634,  into  Eq.  (7),  it 
becomes 


(7a) 

P(Sx^l)  = 0.  1317  . 

(7b) 

Solutions  to  Eq,  (6)  for  P(S^^j)  for  clear  are  shown  in  Figure  3.  It  can  be  seen 
that  = 0.  1317  when  x+1  » 6 hours,  therefore,  clear  is  expected  to  persist 

for  more  than  5 hours  about  50%  of  the  time.  The  dots  in  Figure  3 show  values  of 
RF(S  ),  for  winter.  It  can  be  seen  that  RF(S  ) = 0.  1317  when  x=6  hours.  At  this 

X X 

point  the  model  and  the  data  are  in  good  agreement. 

Table  12  shows  that  from  Eq.  (71,  when  clear  is  observed,  it  is  expected  to 
persist  5,  11,  19,  24,  37,  and  56  hours;  50%,  25%,  10%,  5%,  1%,  and  0,  1%  of  the  time, 
respectively.  Corresponding  observed  values  are  shown  in  parentheses. 

The  following  equation  can  be  used  to  estimate  how  long  a sequence  of  successes 
is  expected  to  persist,  given  that  the  sequence  has  just  begun. 


P(S^|fS) 


P(FS^,j) 

P!py) 


(8) 


The  F preceeding  the  S denotes  a failure  followed  by  a success. 

Eq.  (8)  always  yields  smaller  values  than  Eq.  (7),  unless  the  process  is  first- 
order  Markov  in  which  case  the  values  are  identical.  A table  of  values  based  on 
solutions  to  Eq,  (8)  was  not  prepared,  but  sufficient  information  is  included  in  this 
report  to  prepare  such  a table. 
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5.  RUNS 

5. 1 Observed 

Another  way  of  examining  persistence  is  to  consider  the  number  of  runs  of 
exactly  x hours  in  length,  that  is,  n(FSjF),  n(FS2F). . . , n(FS^F).  The  relative 
frequency  of  runs  is  given  by  the  expression 

n(FS  F)  n(FS  F) 

RF(FS^F)=  ^ (9) 

where  n(FS^F)  is  the  observed  number  of  runs  of  exactly  x hours  in  length  and  N is 
the  total  number  of  hours  in  the  data  sample. 

The  observed  number  of  runs,  based  on  28.  080  hours  of  winter  observations, 
and  28,704  hours  of  summer  observations,  at  each  of  the  nine  stations  is  given,  for 
selected  hours,  in  Tables  13  to  16.  The  median  values  are  indicated  with  asterisks. 
Although  the  frequencies  are  based  on  more  than  28,000  observations  at  each  sta- 
tion and  season,  there  are  large  sampling  variations.  For  example,  in  Table  13, 
in  winter,  ADW  had  only  88  runs  of  three  hours  of  clear  while  the  nearby  station 
of  DCAhad  122,  LGA  had  only  4 runs  of  eighteen  hours  while  the  nearby  station  JFK 
had  14,  and  RDU  had  3 runs  of  24  hours  but  8 runs  of  36  hours. 

To  model  the  runs  it  is  assumed  that  one  good  model  can  estimate  runs  at  any 
of  the  nine  stations,  at  least  as  well  as  a 13 -year  data  sample. 

A model  was  considered  that  is  very  similar  to  Eq.  (6).  This  model  requires 
estimates  of  the  conditional  probabilities,  P(s|fS^).  Relative  frequencies  of  success 
given  a failure  and  x hours  of  successes  were  determined  from  the  data  with  the 
following  expression 


rf(s|fs  ) = 

X 


(10) 


These  relative  frequencies,  a selection  of  which  are  given  in  Tables  17  to  20,  were 
used  to  obtain  the  required  conditional  probabilities. 
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Table  14.  Observed  Number  of  Runs,  n(F^F),  of  x Hours  in  Length  Observed  in  the 
Sample  and  Estimated  Through  the  Use  of  Eq.  (14)  When  LE  Three-tenths  Sky  Cover  i 
Considered  a Success.  Median  values  are  identified  with  asterisks. 


r 


UJ 

SP 

-WTO 
-C  ^ 

<u  "S  J; 

•y  TO 

.S  c 

(U 

TO  ^ t; 

s=^ 

S|^ 


•»5 

s&l 

3 0) 

«-C  S 

o xi  ^ 
eu  0) 

|g2 

•6 

£ x:  . 

St-*  !S 

ZTO 
TJ  0) 
•U  fli  t) 

S 13  ^ 
>23 
C E w 

«'£  « 
TO  — 

«fi  M *0 

O w 0) 

■n 

•1-S 

<D  ^ 
^ 0,  TO 

«i  ao 

■S 

hJ?.2 


CO  * •«-  « 

CO  lO^^CD^NlOCD^ 


# # # 

o admoomooooc-om 

CO 


* 

C'j  <DOt^030ooeo»-'io 


co5f«-(coea«-HrHm^  c'jcg 


_ * * * 

'^'coio^^cgeoc^co  coco 


# # # # # 

mccmictnmc^r-co  icm 


00  ’M’  # # ‘i^  ^^  * 

^ ^O>O5^COCO00O5^  05CO  i0^iOlOOOOr>COcO  tfi  o 

04  ^^oioo4^05r-^  t^co  oonco^mu^o^o)  o^r- 

*H  ^ C*40)^04^'!<-^^^1-H  «-h«h 

^ <c  O^C4W^<00>040  OJCO  C4^£o0040)i!oiAP4  lOCO 

g mcDCOcotncocDcocd  coco 

£ 

^ ./^  00^00^&00^»^  OOO  |A«-^ 

V7  cocommcococoir^m  com  mcocooeooot-oo  con 

^ ^ 'It 

mcoocooicoom^  ooo  »^oojo>c'icoo>oc4  0505 

“ ooc»i'»oocof»ooo0oo  oor-  o^^oooojo^’*^  oo 

r"««— I 

O)0)^OOOCOOOO&  COC*4  m^COO<]CO<~<COCOC'»  004 

^ 9-1  ^ o)^t-Hino4cot<-m^ 


04000100^0)^0 

t«OOCOOOCOCOOOCOCO 


o)»^m^o*-<^t^o) 

C'4*^040)0)0)0400c0 

eoeococoo404coc4eo 


00 

00 

CO 

o 

04 

00 

CO 

CO 

O) 

CO 

CO 

CO 

CO 

04 

O 

o 

m 

CO 

tn 

CO 

CO 

CO 

04 

04 

04 

CO 

04 

04 

04 

04 

04 

04 

04 

O) 

m 

CO 

CO 

04 

•K- 

CO 

o- 

CO 

00 

04 

o 

04 

04 

rH 

o- 

CO 

o 

CO 

m 

CO 

CO 

Tf* 

m 

m 

m 

Ofc i Q 
(i<m< 


<:Quy  Q 

m<Q«CB 


Table  17.  Relative  Frequency  of  Success  Given  a Failure  and  x Hours  of  Success,  RFIsIfS^),  Obtained 
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The  probability  of  a run  of  exactly  x hours  in  length,  P(FS^T),  is  the  probability 
that  there  will  be  a failure  followed  by  x successes  followed  by  another  failure.  This 
might  be  estimated  as  follows,  for  runs  of  length  one,  two,  and  x hours,  respective- 
ly: 


P'(FSjF)  = P(F)  P(s!f)  P(f|fSj)  (11) 

P'iFSgF)  = P(F)  P(s|f)  P(sIfSj)  PiFjFSg)  (12) 

P'(FS^F)  = P(F)P(s|f)  P(sIfS^)  P(s|fS2)...P(s|fS^_j)  P(f|fS^)  (13) 

where  P(s|  F)  is  the  estimated  probability  of  a success  given  that  a failure  occurred 
the  previous  hour,  P(s|fSj)  is  the  estimated  probability  of  a success  given  that  a 

success  occurred  and  a failure  occurred  two  hours  earlier P(f|fS^)  is  the 

estimated  probability  of  a failure  given  that  x successes  occurred  the  previous  x 
consecutive  hours  and  a failure  occurred  x + 1 hours  earlier.  The  unconditional 
and  conditional  probabilities  can  be  estimated  from  the  relative  frequencies  but  very 
large  samples  of  data  are  required  to  obtain  statistically  stable  relative  frequencies 
of  long  runs,  because  they  are  rare  events. 

The  points  plotted  in  Figure  7 show  the  nine -station  median  relative  frequencies 
of  success  given  a failure  and  x hours  of  successes  have  occurred.  The  median 
values  are  given  in  Table  21.  Smooth  curves  were  subjectively  drawn  through  the 
points  in  Figure  7.  The  probabilities  required  for  the  solution  of  Eqs.  (11)  to  (13) 
were  estimated  from  these  curves. 

Table  21  shows  values  of  P(sl  FS^)  that  were  estimated  from  the  curves  shown 
in  Figure  7.  The  conditional  probabilities  always  increase  for  at  least  8 hours  and 
most  of  the  values  increase  for  at  least  11  hours. 

The  values  found  in  Table  21  were  used  to  solve  Eq.  (13).  By  substituting 
P'(FS^F)  from  Eq.  (13)  for  RF(FS^F)  in  Eq.  (9)  the  following  expression  is  obtained 
for  estimating  n(FS^F) 

n(FS^F)  = P'(FS^F)N  . (14) 

Solutions  to  Eq.  (14)  are  given  in  Tables  13  to  16.  The  agreements  between  the 
observed  number  of  runs  and  those  calculated  from  Eq.  (14)  are  very  good.  It 
should  be  understood  that  this  is  not  an  independent  test  of  Eq.  (14)  but  rather  a 
subjective  fitting  to  the  data  to  obtain  conditional  probabilities  and  an  objective 
metliod  for  finding  the  desired  probability  estimates. 
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Figure  7,  Relative  Frequencies  of  Success 
Given  Exactly  x Hours  of  Consecutive 
Success  Have  Occurred,  in  Winter  (a)  and 
Summer  (b).  The  curves  were  subjectively 
drawn 
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6.  RECURRENCE 

6.1  Observed 

The  relative  frequency  of  the  recurrence  of  a success  hours  later  given  that 
a success  occurred,  RF(S^|s),  can  be  determined  from  the  data  by  dividing  the 
number  of  occurrences  of  successes  spaced  hours  apart,  n(SS^),  by  the  total  num- 
ber of  successes,  n(S),  that  is, 

, n(SS«) 


i 

i 

i- 

I 


i 

I 

i 

i 

! 

! 


Conditional  recurrence  relative  frequencies  based  on  13  years  of  hourly  ob- 
servations taken  at  each  of  the  nine  stations  are  given,  for  selected  hours,  in 
Tables  22  to  25.  The  median  values  for  each  season  are  also  given  in  the  tables 
and  plotted  in  Figures  8 to  11.  Figure  8 shows  that  there  is  a pronounced  24-hourly 
period  in  clear  weather.  The  period  is  much  stronger  in  summer  than  in  winter. 
Figures  10  and  11  show  that  this  period  is  not  found  in  the  overcast  of  near  over- 
cast categories. 


Figure  8.  Relative  Frequencies 
of  a Success,  Zero -tenths  Sky 
Cover  (clear),  i Hours  Later 
Given  a Success  Has  Occurred. 
In  Winter,  (dots)  and  In 
Summer  (X's).  The  solid 
curve  is  the  solution  to  Eq. 

(17)  with  a'  = 0.  225  for  winter, 
and  the  dashed  curve  is  for 
summer  with  a'  = 0.  265 
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Figure  9.  Relative  Frequencies 
of  a Success.  LE  Three -tenths 
Sky  Cover,  Hours  Later  Given 
a Success  Has  Occurred,  in 
Winter  (dots)  and  in  Summer 
(X's).  The  solid  curve  is  the 
solution  to  Eq.  (17)  with 
a'  = 0.  235  for  winter,  and  the 
dashed  curve  is  for  summer 
with  a'  = 0.  262 
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Figure  10.  Relative  Frequencies 
of  a Success,  GE  Eight -tenths 
Sky  Cover  Hours  Later  Given 
a Success  Has  Occurred,  in 
Winter  (dots)  and  in  Summer 
(X's).  The  solid  curve  is  the 
solution  to  Eq.  (17)  with 
a'  = 0,226  for  winter,  and 
the  dashed  curve  is  for 
summer  with  a'  = 0,  235 
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6.2  Modeled 


K, 

i 


5 

McAllister  proposed  an  expression  of  the  form 

P(S^+^|s^)  = P(S^+jj)  + [l-P(S^)]  (16) 

for  estimating  recurrence  probabilities  of  cloud  cover.  He  used  a = 0,  263  and 

e 

b = 0.  632  as  the  best  estimates  of  the  parameters.  Gringorten  showed  that  Eq.  (16) 
yields  probability  estimates  very  close  to  those  obtained  from  the  bivariate  normal 
distribution  if  the  parameter  b is  fixed  at  0.  620  and  a is  allowed  to  vary  with  the 
climatic  frequency  of  the  event  and  the  basis  persistence  of  the  element. 

Eq.  (16)  was  modified  to;  (1)  eliminate  any  possibility  of  obtaining  probability 
estimates  greater  than  one;  (2)  take  into  account  diurnal  periods  in  weather  events 
and  (3)  obtain  climatic  estimates  independent  of  the  initial  hour.  The  new  equation 
was  expressed  as  follows 


i 


P(S^|S)  = 


1 


) (YZ)  + e 


m 


(17) 


where  YZ  is  the  temporal  average  of  the  product  of  the  two  probabilities,  that  is, 

23 

where  Y and  Z are  probabilities  of  success  at  time  t and  t+K  hours,  respectively; 
and,  W is  the  temporal  average  of  the  lower  of  each  pair  of  probabilities,  that  is, 

23 

W = E W.  (17b) 

i=0  ^ 

where  W.=Y.  or  Z.  . „ whichever  is  smaller, 
i t t+* 

Table  26  shows  that  there  is  a pronounced  diurnal  period  in  sky  cover  occur- 
rences. The  hourly  climatic  frequencies  of  the  events  were  substituted  into  Eq.  (17) 
using  i = 12  hours  and  b = 0.  620  and  the  equation  was  solved  to  find  the  parameter 
a'.  The  a'  values  are  given  in  Table  27. 

) 
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Table  26.  Nine -station  Median  Relative  Frequency  of  Each  Sky  Cover  Category  for 
Each  Hour  of  the  Day 


Hour 

(LST) 

0.0 

Winter 

Category 

LEO.  3 GEO.  8 

1.0 

0,  0 

Summer 

Category 

LEO.  3 GEO.  8 

1,  0 

00 

.351 

. 458 

.463 

. 399 

. 339 

.492 

. 362 

. 27  1 

01 

.361 

.450 

. 469 

. 422 

.362 

. 492 

. 379 

.272 

02 

.352 

.444 

.470 

.420 

.344 

. 503 

.377 

. 293 

03 

. 349 

. 439 

.480 

. 422 

. 350 

. 489 

.376 

. 296 

04 

. 344 

.438 

.480 

.426 

. 278 

, 454 

. 391 

, 300 

05 

.338 

. 439 

.490 

. 432 

. 168 

. 372 

. 451 

. 3 iS 

06 

.293 

. 407 

. 498 

. 427 

. 182 

. 352 

. 483 

. 334 

07 

. 204 

.338 

. 543 

. 439 

. 192 

. 367 

. 474 

, 342 

08 

. 193 

.319 

. 57  3 

.458 

. 204 

. 379 

. 451 

.315 

09 

. 191 

.326 

. 567 

.445 

. 202 

. 384 

. 428 

.291 

10 

. 186 

.335 

. 550 

.435 

. 175 

. 37  5 

. 420 

. 271 

11 

. 177 

,317 

. 543 

, 423 

. 126 

. 329 

. 431 

. 273 

12 

. 170 

.315 

.531 

.421 

. 093 

. 285 

. 443 

. 265 

13 

. 164 

. 321 

. 543 

. 422 

. 07  6 

. 256 

. 445 

. 259 

14 

. 168 

. 309 

. 537 

.426 

. 070 

. 252 

. 444 

. 247 

15 

. 171 

. 326 

. 532 

.420 

. 079 

. 259 

. 426 

. 237 

16 

. 181 

,337 

. 531 

. 408 

.095 

. 289 

.410 

. 249 

17 

. 197 

. 359 

. 503 

. 392 

. 117 

. 329 

. 421 

.257 

18 

. 245 

. 403 

.478 

.400 

. 153 

. 352 

. 423 

. 275 

19 

. 307 

.429 

.472 

. 397 

. 149 

, 350 

. 444 

. 280 

20 

.332 

. 438 

. 467 

. 405 

. 206 

. 387 

.410 

. 288 

21 

.338 

. 441 

. 462 

.407 

. 260 

. 443 

. 384 

. 283 

22 

.343 

.448 

. 466 

.413 

.316 

. 469 

. 368 

. 27  8 

23 

. 350 

. 447 

. 47  2 

.417 

. 325 

. 487 

.366 

. 279 

i 

i Table  27,  The  "a"'  Values  Used  in  Eq.  (17)  to  Find  the 

I Curves  Shown  in  Figures  8 to  11 


Season 

Clear 

LE  0.  3 

GE  0.  8 

Overcast 

Winter 

Summer 

0.  225 

0.  265 

0.  235 

0.  262 

0.  226 

0.  235 

0.  232 

0.  234 

Eq.  (17)  was  solved  for  lags  from  1 to  7 1 hours  using  the  a'  values  given  in 
Table  27.  The  resulting  curves  are  shown  in  Figures  8 to  11.  The  fits  to  the 
summer  relative  frequencies  are  excellent  for  all  sky  cover  categories.  The  fits 
to  the  winter  values  are  poor  after  12  hours. 
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KKMAKkS 


Relative  frequencies  of  persistence,  runs  and  recurrence  of  sky  cover  along 
the  east  coast  of  the  United  States  between  New  York  and  North  Carolina,  presented 
in  this  report,  are  based  on  more  than  250,000  hourly  observations  taken  in  winter 
and  a similar  number  taken  in  summer.  They  are  believed  to  be  good  approximations 
of  the  true  probabilities. 

Models  are  presented  for  use  in  estimating  joint  and  conditional  probabilities. 

The  estimates  are  usually  in  good  agreement  with  the  relative  frequencies  when  the 
parameters  are  carefully  chosen.  However,  the  best  parameters  for  the  Central 
East  Coast  area  of  the  United  States  may  not  be  the  best  for  other  geographical 
areas.  Future  studies  will  be  extended  to  other  areas  and  to  improving  the  models. 
Other  weather  elements  are  under  study  at  the  present  time. 
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